Improving Temporal Coherence of Image Features by Clustering Technique Learned from Moving Images

نویسنده

  • Sainbayar Sukhbaatar
چکیده

Object recognition is difficult because the appearance of an object changes in many different ways. To recognize objects robustly, one needs representations that are constant despite those changes. Such invariant representations can be obtained by features with low sensitivity to various visual transformations. Spatial pooling is a widely used technique for extracting invariant features from images. When the same feature is extracted from different locations of images, activations from nearby locations can be clustered together (i.e., added together) to create invariance to small spatial shifting. However, the invariance produced by spatial pooling is limited to spatial shifts. In addition, spatial pooling can only be applicable to convolutional features because spatial pooling makes clusters of features by using their spatial topography only. In this thesis, we propose a novel pooling method, auto-pooling, that learns soft clustering of features from image sequences. Auto-pooling is trained to improve the temporal coherence of features, while keeping the information loss by pooling at minimum. Our method does not use spatial information, so it can be used with non-convolutional models too. Experiments on images extracted from natural videos showed that our method can cluster similar features together. When trained by convolutional features, the auto-pooling outperformed the spatial pooling on an image classification task, even though the autopooling does not use the spatial topology of features.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Face Detection Method Based on Over-complete Incoherent Dictionary Learning

In this paper, face detection problem is considered using the concepts of compressive sensing technique. This technique includes dictionary learning procedure and sparse coding method to represent the structural content of input images. In the proposed method, dictionaries are learned in such a way that the trained models have the least degree of coherence to each other. The novelty of the prop...

متن کامل

A PRACTICAL APPROACH TO REAL-TIME DYNAMIC BACKGROUND GENERATION BASED ON A TEMPORAL MEDIAN FILTER

In many computer vision applications, segmenting and extraction of moving objects in video sequences is an essential task. Background subtraction, by which each input image is subtracted from the reference image, has often been used for this purpose. In this paper, we offer a novel background-subtraction technique for real-time dynamic background generation using color images that are taken fro...

متن کامل

The Application of Multi-Layer Artificial Neural Networks in Speckle Reduction (Methodology)

Optical Coherence Tomography (OCT) uses the spatial and temporal coherence properties of optical waves backscattered from a tissue sample to form an image. An inherent characteristic of coherent imaging is the presence of speckle noise. In this study we use a new ensemble framework which is a combination of several Multi-Layer Perceptron (MLP) neural networks to denoise OCT images. The noise is...

متن کامل

Rice Classification and Quality Detection Based on Sparse Coding Technique

Classification of various rice types and determination of its quality is a major issue in the scientific and commercial fields associated with modern agriculture. In recent years, various image processing techniques are used to identify different types of agricultural products. There are also various color and texture-based features in order to achieve the desired results in this area. In this ...

متن کامل

Object Tracking Using Piecewise Feature Clustering

Object tracking is a complex, yet essential task to be addressed in any video surveillance application. Many real-time techniques proposed in the literature rely on a frame-to-frame matching of objects. This paper describes a technique which takes into consideration the inherent temporal coherence that exists across frames, thus being able to robustly perform tracking while handling difficult s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013